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ABSTRACT. Random matrices arise in many mathematical contexts, and it is natural to ask about 
the properties that such matrices satisfy. If we choose a matrix with integer entries at random, for 
example, what is the probability that it will have a particular integer as an eigenvalue, or an integer 
eigenvalue at all? If we choose a matrix with real entries at random, what is the probability that 
it will have a real eigenvalue in a particular interval? The purpose of this paper is to resolve these 
questions, once they are made suitably precise, in the setting of 2 x 2 matrices. 



1. Introduction 

Random matrices arise in many mathematical contexts, and it is natural to ask about the proper- 
ties that such matrices satisfy. If we choose a matrix with integer entries at random, for example, 
we would like to know the probability that it has a particular integer as an eigenvalue, or an integer 
eigenvalue at all. Similarly, if we choose a matrix with real entries at random, we would like to 
know the probability that it has a real eigenvalue in a particular interval. Certainly the answer 
depends on the probability distribution from which the matrix entries are drawn. 

In this paper, we are primarily concerned with uniform distribution, so for both integer- valued 
and real- valued cases we must restrict the entries to a bounded interval. In an earlier paper J6l, 
the authors show that random n x n matrices of integers almost never have integer eigenvalues. 
An explicit calculation by Hetzel, Liew, and Morrison fl4) shows that a 2 x 2 matrix with entries 
independently chosen uniformly from [—1,1] has real eigenvalues with probability 49/72. This 
calculation gives hope that our more precise questions about eigenvalues of a particular size might 
be accessible in the 2x2 setting. The purpose of this paper is to resolve these questions, once we 
make them suitably precise. 

For an integer k > 1, let M. 2 (k) denote the uniform probability space of 2 x 2 matrices of 
integers with absolute value at most k. Note that \M 2 {k)\ = (2k + l) 4 = 16fc 4 + 0(k 3 ). We will 
obtain exact asymptotics for the number of matrices in Ai 2 {k) having integer eigenvalues and, 
more precisely, for the number of matrices with a given integer eigenvalue A. 

For any integer A, define 

M 2 {k) = {M G M 2 {k) : A is an eigenvalue of M}, 

and let 

M 2 L (k) = \jM x 2 (k). 
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Theorem 1. Define the function V : [-2, 2] — > R by V(—6) = V(8) and 



V(6) 



'A- 25 - 5 2 + 5 2 log(l + 5) - 2(1 - 5) log(l - 5), i/0 < 5 < 1, 

l + log2, if5 = l, 

A- 25 - 5 2 + 5 2 \og(6 + 1) + 2(5 - 1) log(5 - 1), ifl<5<V2, 

5 2 - 25 - {5 2 - 25 + 2) log (5 - 1) ify/2<6<2 



(1) 



(where log w ?/ze natural logarithm). Then for any integer A between —2k and 2k, 

\M x 2 (k)\ = 2W{ n X J k) k 2 \o g k + 0(k 2 ), (2) 

where the implied constant is absolute. On the other hand, if\X\ > 2k then Jid^ik) is empty. 

We remark that the function V(6) is continuous and, with the exception of the points of infinite 
slope at 5 = ±1, differentiable everywhere (even at 5 = ±2, if we imagine that V(5) is defined to 
be when \5\ > 2). Notice that equation ([2]) is technically not an asymptotic formula when A is 
extremely close to ±2k, because then the value of V(X/k) can have order of magnitude 1/ log A; 
or smaller, making the "main term" no bigger than the error term. However, equation is truly 
an asymptotic formula for |A| < 2k — ip(k)k/(log k) 1 ^ 3 , where ip(k) is any function tending to 
infinity (the exponent 1/3 arises because V(5) approaches cubically as 5 tends to 2 from below). 

By summing the formula ([2]) over all possible values of A, we obtain an asymptotic formula for 
\M% (k)\. We defer the details of the proof to Section[3j 

Corollary 2. Let C = {7^2 + 4 + 31og( v / 2 + 1))/3tt 2 w 0.55873957. The probability that a 
randomly chosen matrix in Ai 2 (k) has integer eigenvalues is asymptotically C (log k)/k. More 
precisely, 

\M%{k)\ = 16Ck 3 logk + 0{k 3 ). 



If M G M.2 (k) has eigenvalue A, then the scaled matrix k~ l M has eigenvalue X/k, which is the 
argument of V that appears on the right-hand side of ([2]). Thus one interpretation of Theorem[T]is 
that for large k, the rational eigenvalues of k~ l M tend to be distributed like the function V. 

Note that the entries of k~ x M are sampled uniformly from a discrete, evenly-spaced subset of 
[—1,1]. As k — > oo this probability distribution converges in law to the uniform distribution on 
the interval [—1, 1]. Let M.2{\— 1, 1]) denote the probability space of all 2 x 2 matrices whose 
entries are independent random variables drawn from this distribution. One might ask whether the 
distribution given by Theorem[T]is just a discrete approximation to the distribution of eigenvalues 
in M.2{[— 1, 1]); the answer, perhaps surprisingly, is no. The next theorem provides this latter 
distribution. 

Theorem 3. Define W(5) to be the density function for real eigenvalues of matrices in A^2([— 1> 1])." 
if M is a randomly chosen matrix from M.2{[— 1, 1]), then the expected number of eigenvalues of 
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M in the interval [s, t] is f* W{6) dS. Then W(-6) = W(6) and 

' (80 + 205 + 905 2 + 525 3 - 1075 4 )/(144(1 + 6)) 



"(5- 


76 + 86 2 )(l-6) log(l -6)/12 




-6(1- 


-5 2 )log(l + 5)/4, 


i/0 < 6 < 1, 


6{20 + 106 - 


- 126 2 - 36 3 )/(lQ(l + 6)) 




+ (36- 


-l)(<5-l)log(«J-l)/4 




+ 6(6 2 


-l)log(<f + l)/4, 


*/l < 5 < V2, 


6(6 -2)(2- 


-Q6 + 36 2 )/(lQ(6 -1)) 




-{s- 


l) 3 log(5-l)/4, 


ifV2 <5<2, 



(0, if6>2. 



As in the case of V(6), the function W(6) is continuous and differentiable everywhere, with 
the exception of the points of infinite slope at 6 = ±1. (The value W(l) = 15/32 makes the 
function continuous there, although the value of a density function at a single point is irrelevant 
to the probability distribution.) It also shares the same cubic decay as 6 tends to 2 from below. 
However, there are obvious qualitative differences between the functions V and W. In Figure [T] 
we plot V and W on the same axes, normalized so that the area under each is 2 (these normalized 
versions are denoted U Ij and U R in our earlier paper [6]). In the case of A4 2 (k), this normaliza- 
tion corresponds to conditioning on having integer eigenvalues, that is, scaling by the probability 
C(\ogk)/k from Corollary [5j For M 2 ([-l, 1]) we are conditioning on having real eigenvalues, 
which occurs with probability 49/72 (this can be obtained by integrating W(6), analogously to the 
proof of Corollary [2j the computation by Hetzel, Liew, and Morrison [4J is more direct). 

Note that the distribution W(6) is bimodal, having its maxima at 6 ~ ±0.75030751. Thus, a 
random matrix in M.2([— 1, 1]) is more likely to have an eigenvalue of magnitude near 3/4 than 
one of magnitude near 0. We expect this would still hold if we were to condition on matrices in 




M.2{[— 1, 1]) having rational eigenvalues, since any matrix with real eigenvalues is a small pertur- 
bation from one with rational eigenvalues. That this is not true for V(5) shows that the eigenvalue 
distribution of M. 2 (k) is not purely the result of magnitude considerations but also encodes some 
of the arithmetic structure of the integers up to A;. 

We remark that Theorem [T] can also be obtained from a powerful result of Katznelson [5]. Let 
B be a convex body containing the origin in IR 4 , and embed the set of 2 x 2 integer matrices 
(? j) as lattice points (a,b,c,d) E Z 4 . Then Theorem 1 of [|5l gives an asymptotic formula for 
the number of singular integer matrices inside the dilate k ■ B. Taking B = [— 1, l] 4 then yields 
an asymptotic formula for \M 2 {k)\, and more generally one can obtain \M 2 (k)\ by adding and 
subtracting appropriate shifts of B. The asymptotic formula in [5] is defined in terms of an unusual 
singular measure supported on the Zariski-closed subset of IR 4 corresponding to singular matrices. 
The explicit computation of this measure is roughly analogous to our case-by-case considerations 
in Section |4j modulo the significant complications of carrying error terms. Our techniques are 
more elementary, but Katznelson's results apply in theory to matrices of any size, whereas our 
methods become unwieldy even for 3 x 3 matrices. 

In the case ofnxn matrices with entries independently chosen from a Gaussian distribution, a 
great deal more is known. Edelman [1J has computed the exact distributions of the real and com- 
plex eigenvalues for any n, as well as the number of real eigenvalues (for instance, the probability 
of having all n eigenvalues real is precisely 2~™( n ~ 1 )/ 4 ). As n — * oo, the real eigenvalues, suit- 
ably rescaled by a factor of converge to the uniform distribution on [—1,1]. Similarly, the 
complex eigenvalues converge to the "circular law" predicted by Girko [3|, namely the uniform 
distribution on the unit disk centered at the origin. Very recently, Tao and Vu [8| have shown that 
the circular law is universal: one can replace the Gaussian distribution by an arbitrary distribution 
with mean and variance 1. Similar results have been established for random symmetric matrices 
with entries independently chosen from a Gaussian distribution (the "Wigner law") or from other 
distributions. 

Those who are interested in the connections between analytic number theory and random matrix 
theory might wonder whether those connections are related to the present paper. The matrices 
in that context, however, are selected from classical matrix groups, such as the group of n x n 
Hermitian matrices, randomly according to the Haar measures on the groups. The relationship to 
our results is therefore minimal. 



2. Preliminaries about matrices 

We begin with some elementary observations about 2x2 matrices that will simplify our com- 
putations. The first lemma explains why the functions V and W are supported only on [—2, 2]. 

Lemma 4. Any eigenvalue of a matrix in M. 2 (k) is bounded in absolute value by 2k. Any eigen- 
value of a matrix in A^Q— 1, 1]) is bounded in absolute value by 2. 

Proof. We invoke Gershgorin's "circle theorem" O, a standard result in spectral theory: let M = 
(rriij) be an n x n matrix, and let D(z, r) denote the disk of radius r around the complex number 
z. Then Gershgorin's theorem says that all of the eigenvalues of M must lie in the union of the 
disks 

D(m Ul l m ijl)> D(m 22 , ^ \ m 2j\), D(m nn , ^ l^nilj- 

l<?'<n ' ^ l<j<n ' ^ l<j<n ' 

4 



In particular, if all of the entries of M are bounded in absolute value by B, then all the eigenvalues 
are bounded in absolute value by nB. □ 

The key to the precise enumeration of jM 2 (k) is the simple structure of singular integer matrices: 

Lemma 5. For any singular matrix M e M. 2 {k), either at least two entries of M equal zero, or 
else there exist nonzero integers a, b, c, d with (a, b) = 1 such that 

ac be 



Moreover, this representation of M is unique up to replacing each of a, b, c, d by its negative. 

Proof. If one of the entries of M equals zero, then a second one must equal zero as well for the 
determinant to vanish. Otherwise, given 



M 



mn rn X2 
m 21 m 2 2 



with none of the equal to zero, define c = (mn, mi 2 ), and set a = mn/c and b = m\ 2 /c, so 
that (a, b) — 1. Since M is singular, the second row of m must be a multiple of the first row — that 
is, there exists a real number d such that m 2 i = ad and m 22 = bd. Since a and b are relatively 
prime, moreover, d must in fact be an integer. 

This argument shows that every such matrix has one such representation. If 

'a'c' 6'c' N 



l/ ~ \a'd' b'd' 

is another such representation, then (a', b') = 1 implies (a'c', b'd) = \c'\, which shows that \d\ = c; 
the equalities \a'\ = \a\, \b'\ = \b\, and \d'\ = \d\ follow quickly. □ 

For a 2 x 2 matrix M = we define discM = (trM) 2 - 4det M = (a - df + 46c. It 

is easily seen that this is the discriminant of the characteristic polynomial of M. We record the 
following elementary facts, which will be useful in the proof of Lemma[7]and Proposition [13} 

Lemma 6. Let M be a 2 x 2 matrix with real entries. 

(a) M has repeated eigenvalues if and only if disc M = 0. 

(b) M has real eigenvalues if and only if disc M > 0. 

(c) det M < if and only if M has two real eigenvalues of opposite sign. 

(d) If det M > and disc M > 0, then the eigenvalues of M have the same sign as tr M. 

Proof. Let A 1; A 2 denote the eigenvalues of M, so that trM = A x + A 2 , detM = AiA 2 , and 
discM = (Ai — A 2 ) 2 , each of which is real. Parts (a), (b) and (d) follow immediately from these 
observations, and part (c) from the fact that A 2 = Ai if Ai, A 2 are complex. □ 

The next lemma gives a bound for the probability of a matrix having repeated eigenvalues. It 
is natural to expect this probability to converge to as k increases, and indeed such a result was 
obtained in H for matrices of arbitrary size. We give a simple proof of a stronger bound for the 
2x2 case, as well as an analogous qualitative statement for real matrices which will be helpful in 
the proof of Theorem [3] 

Lemma 7. The number of matrices in M. 2 (k) with a repeated eigenvalue is <^L £ k 2+£ for every 
e > 0. The probability that a random matrix in M. 2 {[— 1, 1]) has a repeated eigenvalue or is 
singular is 0. 
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Proof. By Lemma [6^ a), the 2x2 matrix ("^) has a double eigenvalue if and only if 46c = 
— (a — d) 2 . For matrices in A^Q— 1, 1]) this is easily seen to be a zero-probability event, as is the 
event that det M = ad — be = 0. 

For matrices in M. 2 {k), we enumerate how many can satisfy 46c = —(a — d) 2 . If a = d then 
there are Ak + 1 choices for b,c E {—k, . . . , k}; otherwise there are at most 2r((a — d) 2 /4) choices 
if a = d (mod 2) and no choices otherwise. (Here r(n) is the number-of-divisors function; the 
factor of 2 comes from the fact that b and c can be positive or negative, while the "at most" is due 
to the fact that not all factorizations of —(a — d) 2 result in two factors not exceeding k.) Therefore 
the number of matrices in M.2{k) with a repeated eigenvalue is at most 

+ + 2 £ £ T ((^>!W« (5) 

\a\<k \a\<k \d\<k ^ ' 

a=d (mod 2) 

where the inequality follows from (a — d) 2 /4 < A; 2 and the well-known fact that r(n) <C e n £ for 
any e > (see for instance [7, p. 56]). □ 

3. Enumeration theorems for integer eigenvalues 
Let //(n) be the Mobius function, characterized by the identity 

Emm = ( 1 ' ifn = 1 



0. if// > I. (6) 



The well-known Dirichlet series identity l/((s) = Y^=\l Ji { n ) n ~ s ^ s valid for 3?s > 1 (see, for 
example, [7, Corollary 1.10]). In particular, n(n)/n 2 = 6/tt 2 , and we can estimate the tail of 
this series (using < 1) to obtain the quantitative estimate 



E 



" M - 6 +0 fi'|. (7) 



d? 7T 2 \k 

d<k v 

Lemma 8. For nonzero integers a, b and parameters k, X, define the function 

N k:X (a, b) = #{(c, d) G Z 2 , c ^ 0, d ^ : \ac + \\, \bc\, \ad\, \bd + A| < k}. (8) 

Then 

\M$(k)\=4j2Kd) J2 N k , x (da,d(3) + 0(k 2 ), 

d<k \<a<l3<k/d 

where the implied constant is independent of A and k. 

Proof. Fix an integer < A < 2k, and let M G Ai^k), so that M — XI is singular. By Lemma |5j 
either at least two entries of M — XI equal zero, or else M — XI has exactly two representations of 
the form ([4]). In the former case, there are 2k + 1 choices for each of the two potentially nonzero 
entries, hence 0(k 2 ) such matrices in total (even taking into account the several different choices of 
which two entries are nonzero). In the latter case, there are exactly two corresponding quadruples 
a, b, c, d of integers as in Lemma [5] Taking into account that each entry of M must be at most k in 



absolute value, we deduce that 

\M x (k)\ = \#{{a,b,c,d) G Z 4 : a,b,c,d^0, 

(a, b) = 1, |ac + A|, |6c|, \ad\, \bd + A| < £;} + 0{k 2 ) 
= \ N ktX (a,b) + 0(k 2 ), 

l<|a|,|6|<fc 
(a,b)=l 

where N k:X (a, b) is defined as above. 

Because of the symmetries N k>x (a, b) = N k ^ x (— a, b) = N kjX (a, —b) = N k:X (—a, —b), we have 

\M x 2 (k)\=2 N ktX (a,b) + 0(k 2 ). 

l<a,b<k 
(a,b)=l 

The only term in the sum where a = b is the term a = b = 1, and for all other terms we can 
invoke the additional symmetry N ktX (a, b) = N kjX (b, a), valid by switching the roles of c and d in 
the definition ([8]) of N kiX (a, b). We obtain 

\M$(k)\=4 J2 N k , x (a,b) + 2N k , x (l,l) + 0(k 2 ) 

l<a<b<k 
(o,6)=l 



4 ^ iY M (a,6) + 0(A; 2 ), 



l<a<f><fc 
(a,6)=l 

where the last step used the fact that N k>x (l, 1) < #{(c, d) G Z 2 : |c|, |d| < k} < A; 2 . 

Using the characteristic property of the Mobius function (|6]), we can write the last expression as 

\M x 2 (k)\=A X ktX (a,b) Kd) + 0(k 2 ) 

l<a<b<k d\(a,b) 

= 4X>(<0 ^ iV M (a,6) + 0(£; 2 ) 

cZ<fc l<a<6<fc 
d|a, d|6 

= 4^M^) N k , x (da,df3) + 0(k 2 ), 

d<k \<a<l3<k/d 

as claimed. □ 

Lemma 9. Le£ and A £>e integers with < X < 2k, and let x and y be integers with 1 < x < 
y < k. Then 

N k>x (x,y) = k 2 C(^x,y)D( x k ;x,y)+0(^), 

where 



C(5;x,y) = max {0,mm + J,^}}, 

D(6 ] x,y)=mm{ 1 -*+ 1 -, 2 -}. (9) 
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Proof. We have 

N k ,x(x,y) = #{(c,d) G Z 2 , c 7^ 0, d ^ : |xc + A|, |yc|, + A| < A;} 

= #{c G Z, c ^ : -k < xc + A < k, -k < yc < k} 

x #{d g Z, d ^ : -k < xd < k, -k < yd + A < k}. 

Since x and y are positive, we can rewrite this product as 

N k , x (x, y) = #{ceZ,c^0: (-k - X)/x < c < {k - X)/x, -k/y < c < k/y} 

x #{d G Z, d ± : -k/x <d< k/x, {-k - X)/y < d < (k - X)/y} 

= #{c G Z, c ^ : -A;/y < c < min{(£; - A)/ar, fc/y}} (10) 

x #{d G Z, d + : max{-/c/x, {-k - A)/y} < d < (k - X)/y}, 

where we have used A > and x < y to slightly simplify the inequalities. The first factor on the 
right-hand side of equation ( fT0| ) is 

min{(fc- X)/x,k/y} - (-k/y) + 0(1) = fcmin{(l - + 1/y, 2/y} +0(1) 

if this expression is positive, and otherwise; it is thus precisely kC(^; x, y) + 0(1). Similarly, 
the second factor on the right-hand side of equation ( fTOj ) is 

(k - X)/y - max{-A;/x, {-k - X)/y} + 0(1) = k min{(l - $)/y + 1/x, 2/y) + 0(1) 

(note that this expression is always positive under the hypotheses of the lemma), which is simply 
kD(^; x, y) + 0(1). Multiplying these two factors yields 

N k , x (x, y) = k 2 C(l ■ x, y)D(l ; x, y) + k ■ 0{C(\ ; x, y) + £>(£ ; x, y)) + 0(1). 

The lemma follows upon noting that both 0(|; x, y) and -D(f ; x, y) are <C 1/y by definition, so 
that the second summand becomes simply O(^), and the 0(1) term may be subsumed into O(^) 
since y < k. □ 

We have already used the trivial estimate 

1 = (U-L) + 0(1), 

L<a<U 

provided < L < U. We will also use, without further comment, the estimates 

E ^ = 1 °gZ + (i); ^particular, ^ I = log C/ + O(l); 

L<a<U l<a<U 

and 

E ^ = Z-Z7 + °(^); m particular, £ £ = 0(4). 

L<a<U L<a<U 

These estimates (also valid for < L < U) follow readily from comparison to the integrals 
J^l/x dx and 1/x 2 dx. 

Most of the technical work in proving Theorem[T]lies in establishing an estimate on a sum of the 
form ^2i< a< gC(6; a i P)D(8; a, (3) for a fixed (3. The following proposition provides an asymp- 
totic formula for this sum; we defer the proof until the next section. Assuming this proposition, 
though, we can complete the proof of Theorem [T] as well as Corollary [2] 



Proposition 10. Let (3 > 1 and < 5 < 2 be real numbers, and let C and D be the functions 
defined in equation ([9]). Then 

l<a</3 ^ VP/ 

where V(5) was defined in equation ([T]). 

Proof of Theorem [7] assuming Proposition |70] The functions C and D defined in equation ([9]) are 
homogeneous of degree —1 in the variables x and y, so that Lemma [9] implies 

N k>x (da,d(3) = ^(f;a,/3)D(|;a^) + 0(|). 
Inserting this formula into the conclusion of Lemma [8] yields 



)■ 



\Mm-^J2^ E c«;«.^(i;«^) + o(x; E 

We bound the first error term by summing over 1 < a < (3 to obtain 

d<k l<a</3<k/d r d<k l</3<k/d d<k 

so that we have the estimate 

\M x 2 (k)\=^J2^T £ C(J; a, /?) D(£ ; a, /?) + 0(k 2 ). (11) 

rf<fc l<a</3<k/d 

We now apply Proposition [T0| to obtain 



d<fe l<P<k/d V ^ \ P / / 

= 4fc2 E # (^( A A) ( log ^ + o(i)) + o(i)) + o(e) 
d<k ^ ^ ' ' 

= «• (v< A/t) log * E + o ( E 1") ) + 0(*») 



4fc 2 ( V(X/k) log A: ( ^ + O Q ) ) + O(l) ) + 0(fc^ 



r2 



7T Z 



-£; 2 logfc + 0(fc 2 



where we have used equation ([7]) and the fact that 1/n 2 and Y^(log n)/n 2 are convergent (so the 
partial sums are uniformly bounded). □ 

Proof of Corollary^from Theorem^ Note that for any M e A^fc), if one eigenvalue is an inte- 
ger then they both are (since the trace of M is an integer). Thus if we add up the cardinalities of all 



of the Ai^ik), we get twice the cardinality of .Mf (k), except that matrices with repeated eigen- 
values only get counted once. However, the number of such matrices is <C e k 2+e by Lemma |7j 
Therefore 



2\Ml{k)\=Y,\M x 2 {k)\+0 £ {k 2+ -< 

E ( 2W ^J k) k 2 \ogk + 0{k 2 )\+0 £ {k^ 



-2k<\<2k 
„3 



24fc 3 logfc V(X/k) 



7T 2 ^—f k 

-2k<\<2k 



The sum is a Riemann sum of a function of bounded variation, so this becomes 

2\M*(k)\ = ( £ V(«5) d* + 0(1)) + 0(fc 3 ). 

The corollary then follows from the straightforward computation of the integral J 2 2 V{5) d5 = 
1(7^2 + 4 + 3 \og(y/2 + 1)), noting that log(>/2 - 1) = - log(>/2 + 1). □ 



4. Proving the key proposition 

It remains to prove Proposition [TOj Recalling that the functions C and D defined in equation (|9j) 
are formed by combinations of minima and maxima, we need to separate our arguments into several 
cases depending on the range of 5. The following lemma addresses a sum that occurs in two of 
these cases (0 < 5 < 1 and 1 < 5 < y/2). Note that because of the presence of terms like log(5 — 1) 
in the formula for V(5), we need to exercise some caution near 5=1. 

Lemma 11. Let (5 > 1 and < 5 < v2 be real numbers, with 5^1. Then 



E ( a + /?)/? 



max{l,|l-(5|/3}<a<(l+(5)- 1 /3 



2 ( 1 n jti m m n r 2 A , + log/3 

' ' A (1 - 5) log |1 - 5 2 \ + O' 



- 



p\i + 5 1 1 v ' 01 7 v 



Proof. Suppose first that |1 — 5\(3 > 1. Then the sum in question is 
2(1-*) v 1 2 v 

|l-5|/3<a<(l+<5)- 1 /3 |l-<5|/3<a<(l+5)- 1 /3 



+ - |i - + 0(1) 



(3 2 \1 + 5 

K^-ll-^l-d-^logll-^+ofl 
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which establishes the lemma in this case. On the other hand, if |1 — 5\(3 < 1 then the sum in 
question is 

2(1-*) V 12 v 
^ a 2 

l<a<(l+<5)- 1 /3 l<a<(l+5)- 1 /3 



(3 \ & 1 + 5 v 'J 2 \l + 6 

2 ( 1 (i-.)io g( i + ^ + o^ ■ }1 - s ^< 3 



We subtract 2 ( 1 1 — 5 1 + (1 — 8) log 1 1 — 8\ ) //3 from the main term and compensate in the error term 
to obtain 

2(1-*) V 12 v 
^ a 2 ^ 

1<q<(1+5)- 1 /3 l<a<(l+<5)- 1 /3 

= !(r^-ii-*i-a-*)iog|i-* 2 

/l |1 - 5|log/? |l-^| + |l-<i|log|l-<i|-') \ 

-*( i _| 1 _ i |_ (1 _ i)log | 1 _^ + ofHi^ + ' 1 -^ og ' 1 -^ 



/3\l + 5 1 1 v ' 01 J \ P* 

since we are working with the assumption that |1 — 5| < 1//3. Because the function tlogt^ 1 is 
increasing on the interval (0, 1/e) and bounded on the interval (0, 1], we have \1 — S\ log |1 — 5| _1 < 
{1/0) log/? if > e and \l-5\ log 1 1 — 5| 1 < 1 < if 1 < < e. In either case, the last error 
term can be simplified to 0{{1 + log 0) / 2 ), which establishes the lemma in second case. □ 

Proof of Proposition 10 We consider separately the four cases corresponding to the different parts 
of the definition {]} of V{5). 

• Case 1: < 8 < 1. In this case we have 0<1 — 5 < {1 + 5) _1 < 1 and 

nt* r\ Sh if a < {1-6)0, Jf, if a < {1 + 5)^0, 

C{5;a,0) = ->P and D{5; a, 0) = \ p 1 _* l ■ - , ^-i* ^ 

(V + f> if {I - 8)0 < a + lf ( 1 + 5 ) @< a - 

Therefore 

Y, C(S;a,p)D(S;a,f))= V |.| + V feM)§ 

l<a</3 1<q<(1-5)/3 M M max{l,(l-<5)/3}<a<(l+<5)- 1 /3 V 



, a I \ a , 

(The first sum might be empty, but this does not invalidate the argument that follows.) The first 
sum is simply 

i E ,-^-^on-^o{^. 





l<a<(l-<5)/3 
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By Lemma 1 1 , the second sum is 

( 1-5 1\2 
^ 1 a + PJP 



max{l,(l-5) / 3}<a<(l+5)- 1 /3 



1 + 5-l-(l-5)lo g (l-5>)) + o( 1 + l °^ 



P\l + 5 ' ov J \ P 

while the third sum is 

^ (1-5 l\(l-5 1 

(l+5)-i/3< a </3 V y/ V y 

M „ ^ 1 5 2 - 2(5 + 2 ^ 11-5 ^ 

= (1 - 5) E ^ + — p — E « + ^ E 1 

(l+<5)- 1 /3<a</3 ^ (1+5)- 1 /3<q</3 ^ (l+5)" 1 /3<a</3 

+ 5 1 ^f 1 \\ 5 2 - 25 + 2 (^ /3 /l 

= ^ ( 2 - * 2 - db + ^ ~ 2S + 2) log(1 + S) ) + (w 1 ■ <13) 



This case of the proposition then follows from equation ( [12] ), on noting that 

4-45+ - + 25-2-2(1-5) log(l - 5 2 ) + 2 - 5 2 - - + (5 2 - 25 + 2) log(l + 5) 

1+5 1+5 

= 4 - 25 - 5 2 + 5 2 log(l + 5)- 2(1 - 5) log(l - 5). 

• Case 2: 5 = 1. In this case we have 

C(6]a,p ) = -= and D(6;a,P ) = < 1/ „ , 

p I 1/a, if p/2 < a. 



Therefore 



l<a</3 l<a</3/2 ^ " (3/2<a</3 " 



l + log2 +0 / 1 



(3 \(3 
as desired. 

• Case 3: 1 < 5 < \[2. In this case we have 
,,,, ,, ,0, ifa<(i-l)/3, J§, if Q < (<J + 
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Therefore 



C(5;a,P)D(S;a,P)= ]T 



l<a</3 



max{l,(5-l)/3}<a<(<5+l)- 1 /3 



1-5 l\ 2 



+ E 



(5+l)" 1 /3<a</3 



— + + « ] (14) 



(We note that (5 — i)(3 < (8 + 1) 1 /3 for 5 between 1 and y/2. For very small (3 we might have 
1 > (5 + l) _1 /3, in which case the first sum is empty, but that does not invalidate the argument that 



follows.) By Lemma 1 1 the first sum is 



E 



max{l,(5-l) / 3}<a<(l+5)- 1 / 3 



1-5 1\ 2 

« + ft) ]) 



while the second sum has already been evaluated in equation ( |13| ) above. This case of the proposi- 
tion then follows from equation (fT4]), on noting that 



~ +2-25-2(1-5) log(5 2 - 1) + 2 - 5 2 - + (5 2 - 25 + 2) log(5 + 1 



1 + 5 



5 + 1 

4 - 25 - 5 2 + 5 2 log(5 + 1) + 2(5 - 1) log(5 - 1). 



Case 4: a/2 < 5 < 2. Just as in Case 3, we have 



+ if(<5 -!)/?< a " l^ + i if(«+l)-'/3<a. 



However, the inequality (5 — \)(3 < a automatically implies that (5 + 1) 1 (3 < a when 5 > \[2. 
Therefore 



£ C(S;a,f))D(6; a ,l3)= £ + (Izi + I) 
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(In this case we will not need to use the more precise lower bound max{l, (8 — < a.) This 
yields 

£ C(5;a,P)D(S;a,P) 

l<a</3 

m ss V- 1 5 2 -25 + 2 ^ 1 1-5 ^ , 

= (1 - 5) D ^ + — D a + H^ E 1 

{8-\)P<a<p (8-l)/3<a</3 (6-l)P<a<P 



5 2 -25 + 2/ /? ./ 1 \\ 1-5, , , , ,, 

= i (> - 25 - (<5 2 -26 + 2) \og(5 -l)\+0 (± 
where the error terms have been simplified since 5 — 1 is bounded away from 0. 



This ends the proof of Proposition 10 □ 

5. Distribution of real eigenvalues 
In proving Theorem |3} it will be convenient to define the odd function 

G(z)= [ - log \t\ dt = z(l- log \z\), (15) 



whose relevance is demonstrated by the following lemma. 

Lemma 12. If B and C are independent random variables uniformly distributed on [—1, 1], then 
the product BC is a random variable whose distribution function is Fbc{z) — Pt(BC < z) = 
l(l + G(z))forze [-1,1]. 

Of course, for z < —1 we have F BC (z) = 0, and likewise F BC (z) = 1 for z > 1. 

Proof. Note that \B\ and \C\ are uniformly distributed on [0, 1]. For < z < 1, we easily check 
that Px(\BC\ < z) = min{l, z/s} ds = G(z). Thus \BC\ is distributed on [0, 1] with density 
f\BC\( z ) — — log 2, and by symmetry BC has density fsciz) — — § log \z\ on [— 1, 1]. The lemma 
follows upon computing F B c{z) = /bc(s) ds. □ 

It will also be helpful to define the following functions, which are symmetric in x and y: 

ux(x, y) = 1/2 + G{xy)/2 + G((x - yf/A), (16) 
u 2 (x,y) = l/2-G(xy)/2, (17) 

'ui(x,y), if xy < 1 and x + y < 0, 

v i (%,y), if < 1 and x + y > 0, 



u(x,y) 



(18) 

1 + G((x - 2/) 2 /4), if > 1 and x + y < 0, 
0, otherwise. 



To prove Theorem[3j we first consider the distribution function Fw (5) = f t<s W{t) dt associated 
to the density W{8). For a random matrix M in A^Q— 1, 1]) and a real number <5, we will derive 
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an expression for the expected number of real eigenvalues of M falling below 5, then differentiate 
it to obtain W{5). 

Since the set .M2Q— 1? 1]) is closed under negation, it is clear that W(— 5) = W(S), so it suffices 
to compute W(S) for 5 E [0,2]. It turns out that our calculations for F w will be somewhat 
simplified by considering F w (— 5) rather than F W (S). 

Proposition 13. We have 

-1+5 r l+5 



Fw(-8) = -7 / v(x,y)dxdy 

4 J -1+8 J -1+5 



for all < 5 < 2, where v is defined in equation (18). 



Proof. Let us denote the entries of M by the random variables A, B, C, D, which by assumption 
are independent and uniformly distributed in [—1, 1]. Let 5 be fixed in the range [0, 2], and consider 
the shifted matrix M' = M + SI, which we write as 



M' 



X B 
C Y 



where B, C are as before, and X, Y range independently and uniformly in [—1 + S, 1 + §}. 
Clearly the eigenvalues of M less than —5 correspond to the negative (real) eigenvalues of M' . By 
Lemma[7j we are free to exclude the null set where M ' is singular or has repeated eigenvalues. Out- 
side of this null set, M' has exactly one negative eigenvalue if and only if det M' = XY — BC < 0, 
by Lemma|6fc). Likewise by Lemma|6jd), M' has exactly two negative eigenvalues if and only if 
XY - BC > and X + Y < and disc M' = (X — Y f + ABC > 0. We thus have: 

F w (-5) = Pr(BC > XY) + 2Pr(X + Y < and -(X - F) 2 /4 < BC < XY). 

We may express this probability as the average value 



-1+5 i-l+S 

F w (-5) = -l j p(x,y)dxdy, 

'-1+5 J -1+5 



4 



where for fixed x and y, 

p(x, y) = Pt(BC > xy) + 2Pr(x + y < and - (x - yf/A < BC < xy) 

= Pi(BC > xy) + 2Pr(-(x - y) 2 /A < BC < xy)l{x + y < 0} (19) 

(here 1{-} denotes the indicator function of the indicated relation). To complete the proof it suffices 
to show that p equals the function v defined in equation <|T~8]). 



The probabilities appearing in equation { fT9| ) are effectively given by Lemma 12 However, there 
is some case-checking involved in applying this lemma, since the value of, say, Pr(£>C > xy) = 
1 — Fbc( x u) depends on whether xy < —1, — 1 < xy < 1, or xy < —1. We make some 
observations to reduce the number of cases we need to examine. 

Note that (x—y) 2 /A is bounded between and 1 for any x,y G [—1+5, 1+5], so that —(x—y) 2 /A 



always lies in the interval [—1,1] prescribed by Lemma 12 From the identity (x + y) 2 — (x — y) 2 = 
Axy we see also that xy > —(x—y) 2 /A. Thus xy is never lower than —1, and we need only consider 
whether xy > 1 (in which case FBc(xy) = 1). We therefore have 

Pt(BC >xy) = l- F BC (xy) = |(1 - G(xy))l{xy < 1} 
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and 

2P r (_( x _ yf/A <BC< xy) = 2F BC (xy) - 2F BC (-(x - yf/A) 

= l{xy > 1} + G(xy)l{xy < 1} + G{(x - yf/A). 
Inserting these two evaluations into the formula ( [19] ), we obtain 

p(x,y) = §(1 - G(xy))l{xy < 1} 

+ (l{xy > 1} + G(xy)l{xy < 1} + G((x - y) 2 /A))l{x + y < 0}. 

It can be verified that this last expression is indeed equal to the right-hand side of the definition ([T8 
of v. 



□ 



Since W{5) = W{—$) = — j^F w (—5), to finish the proof of Theorem |3j it therefore suffices to 
prove that —-^Fw(—5) equals the formula given in equation (|3]). 



6. The derivative of the distribution 



Proposition 13 expresses F w (5) as an integral, of a function v that is independent of 5, over the 
square S$ = [— 1 + 5, 1 + 5} 2 . Since the region S$ varies continuously with 5, we can compute the 
derivative —^F w (—5) by an appropriate line integral around the boundary of Sg. Indeed, by the 
fundamental theorem of calculus, we have 

I d f f 1+s f 1+s 

' S) = --^(J- 1+ sJ- M ^ y)dXdV 

-1+6 



d F ( 
-d~5 Fwi 



Y / rl+S rl+S 

- / v(l + 5,y)dy- / v (-l + 6,y)dy 

4 V J -1+6 J -1+6 



1+6 



1+6 



u(x, 1 + S) dx 



1+6 



1+6 



u(x, —1 + 5) dx 



1+6 



1 



u(x, — 1 + 5) dx — - J u(x,l + 5)dx, 



1+6 



(20) 



'-1+6 * J -1+5 

where we have used the symmetry v(x, y) = v(y, x) to reduce the integral to just the top and 
bottom edges of S$ (where y = 1 + S and y = —1 + 5, respectively). 



The evaluation of pO) divides into three cases depending on the behaviour of the indicator 
functions l{x + y < 0} and l{xy < 1} on the boundary of S$ (see Figure[5]). 
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Figure 2. The three cases: < 5 < 1, and 1 < 5 < y/2, and y/2 < 5 < 2 
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• Case 1: < 5 < 1. For this range of 5, the line x + y = intersects the bottom edge of Ss 
at x = 1 — 5, while the hyperbola xy = 1 intersects the top edge at x = (1 + 5) _1 . Thus by the 
definition of v, equation ( [20] ) becomes 



1-5 



-1+5 



1+5 



vi(x, — 1 + 5) dx+ / v 2 (ar, —1 + 5) dx 



1-5 



^(x, 1 + 5) dx 



1+5 



The following elementary antiderivatives, which are readily obtained by substitution and inte- 
gration by parts, follow for any fixed nonzero real number y from the definitions ( [T5] ), ( [To] ), and ( [T7| ) 

of G, z/ 1? and z/ 2 : 

/*<*,,) * = |. + |A(3 - 21og M) + - yft* - 61og |(, - y )/2|), 

i/ 2 (x, y) = |x — ^x 2 y(3 — 2 log \xy\). (21) 



Therefore in this case 

- 1 



\x 



\x 2 { 



^ l + 5)(3-21og|x(-l + 5)|) 
+ + l -5) 3 (5-61og|(x + l -5)/2|) 
+ (|x - \x\-l + 5)(3 - 21og + 5)|)) 

- (fx- §x 2 (l + 5)(3-21og|x(l + 5)|)) 

x=— 1+5 

80 + 205 + 905 2 + 525 3 - 1075 4 (5 - 75 + 85 2 ) (1 - 5) 



1-5 

x-=-l+5 



1+5 

x=l-S 
(1+5)- 1 ' 



144(1 + 5) 
5(1 -5 2 ) 



12 



log(l - 5) 



4 



log(l + 5) 



(after some algebraic simplification), which verifies the first case of Theorem [3} (Note that the 
integrands really are continuous, despite terms that look like log 0, because the function G is con- 
tinuous at 0; hence evaluating the integrals by antiderivatives is valid.) 

• Case 2: 1 < 5 < y/2. Now, the line x+y = does not intersect Ss, while the hyperbola xy =1 
intersects the top edge at x = (1 + 5) _1 . Thus by the definition of v and the antiderivative pTj ) of 
z/ 2 , equation ([20]) becomes 



d 

d~5 



1 w\ 



S) 



If f 1+s [ {l 
-I J u 2 (x, -1 + 5) dx - J 



(1+5)- 1 



u 2 (x, 1 + 5) dx 



(fx - §x 2 (-l + 5)(3 - 21og \x{-l + 5)|)) 



1+5 



- (§x - §x 2 (l + 5)(3 - 21og |x(l + 5)|)) 
5(20 + 105 - 125 2 - 35 3 ) (35 -l)(5-l) 



x=-l+8 
(1+5)- 1 ' 



-1+5 



16(1 + 5) 



log(5-l) + ^' 1) log(5 + l) ! 
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which verifies the second case of Theorem [3] 

• Case 3: < 5 < 2. As before, the line x + y = does not intersect Sg, while the 
hyperbola xy = 1 intersects the bottom edge at x = (8 — l) -1 . Thus by the definition of v and the 
antiderivative pTj ) of z/ 2 , equation ( |20| ) becomes 

-^M-^)=2 / z/ 2 (x,-l+5)rfa; = -(|a;-|a; 2 (l + 5)(3-21og|x(H 

5(5-2)(2-65 + 35 2 ) (<5-l) 3 



=-1+5 

log(<5-l), 



16(5-1) 4 
which verifies the third case of Theorem |3] 

Since the last case of Theorem [3] is a consequence of Lemma |4} the proof of the theorem is 
complete. 

Remark. One could also use the same method to extract the individual distributions of the greater 
and lesser eigenvalues of M: for instance, eliminating the factor of 2 from equation ( [19] ) would 
yield an expression for the distribution of just the lesser eigenvalue of M. 
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